AITopics | context space

Collaborating Authors

context space

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Nonparametric Contextual Bandits in Metric Spaces with Unknown Metric

Nirandika Wanigasekara, Christina Yu

Neural Information Processing SystemsFeb-13-2026, 13:36:39 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, bandit, reward function, (15 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.51)
Information Technology > Data Science > Data Mining > Big Data (0.48)

Add feedback

aceacd5df18526f1d96ee1b9714e95eb-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-13-2026, 13:36:24 GMT

context space, dependence, dimension, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.36)

Add feedback

Contextual Combinatorial Multi-armed Bandits with Volatile Arms and Submodular Reward

Lixing Chen, Jie Xu, Zhuo Lu

Neural Information Processing SystemsFeb-12-2026, 10:01:54 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, cc-mab, reward function, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Florida > Hillsborough County > Tampa (0.14)
North America > United States > Florida > Miami-Dade County > Coral Gables (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.84)

Add feedback

4556f5398bd2c61bd7500e306b4e560a-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 15:46:28 GMT

context space, gaussian distribution, value function, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

Tracking Most Significant Shifts in Nonparametric Contextual Bandits

Neural Information Processing SystemsDec-23-2025, 23:51:47 GMT

We study nonparametric contextual bandits where Lipschitz mean reward functions may change over time.We first establish the minimax dynamic regret rate in this less understood setting in terms of number of changes $L$ and total-variation $V$, both capturing all changes in distribution over context space, and argue that state-of-the-art procedures are suboptimal in this setting.Next, we tend to the question of an _adaptivity_ for this setting, i.e. achieving the minimax rate without knowledge of $L$ or $V$. Quite importantly, we posit that the bandit problem, viewed locally at a given context $X_t$, should not be affected by reward changes in other parts of context space $\cal X$. We therefore propose a notion of _change_, which we term _experienced significant shifts_, that better accounts for locality, and thus counts considerably less changes than $L$ and $V$. Furthermore, similar to recent work on non-stationary MAB (Suk & Kpotufe, 2022), _experienced significant shifts_ only count the most _significant_ changes in mean rewards, e.g., severe best-arm changes relevant to observed contexts.Our main result is to show that this more tolerant notion of change can in fact be adapted to.

name change, nonparametric contextual bandit, significant shift, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.63)

Add feedback

Contextual Combinatorial Multi-armed Bandits with Volatile Arms and Submodular Reward

Lixing Chen, Jie Xu, Zhuo Lu

Neural Information Processing SystemsNov-20-2025, 15:01:08 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, data mining, machine learning, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > Florida > Hillsborough County > Tampa (0.14)
North America > United States > Florida > Miami-Dade County > Coral Gables (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.84)

Add feedback

Online Decision Making with Generative Action Sets

Xu, Jianyu, Jain, Vidhi, Wilder, Bryan, Singh, Aarti

arXiv.org Machine LearningOct-1-2025

With advances in generative AI, decision-making agents can now dynamically create new actions during online learning, but action generation typically incurs costs that must be balanced against potential benefits. We study an online learning problem where an agent can generate new actions at any time step by paying a one-time cost, with these actions becoming permanently available for future use. The challenge lies in learning the optimal sequence of two-fold decisions: which action to take and when to generate new ones, further complicated by the triangular tradeoffs among exploitation, exploration and $\textit{creation}$. To solve this problem, we propose a doubly-optimistic algorithm that employs Lower Confidence Bounds (LCB) for action selection and Upper Confidence Bounds (UCB) for action generation. Empirical evaluation on healthcare question-answering datasets demonstrates that our approach achieves favorable generation-quality tradeoffs compared to baseline strategies. From theoretical perspectives, we prove that our algorithm achieves the optimal regret of $O(T^{\frac{d}{d+2}}d^{\frac{d}{d+2}} + d\sqrt{T\log T})$, providing the first sublinear regret bound for online learning with expanding action spaces.

action space, algorithm, new action, (17 more...)

arXiv.org Machine Learning

2509.25777

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Industry:

Education (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

Nonparametric Contextual Bandits in Metric Spaces with Unknown Metric

Nirandika Wanigasekara, Christina Yu

Neural Information Processing SystemsAug-19-2025, 23:14:08 GMT

The decision maker must select an action based on the context and all past observations.

algorithm, bandit, reward function, (15 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.51)
Information Technology > Data Science > Data Mining > Big Data (0.48)

Add feedback

Reviewer 1: 2 A theoretical upper-bound of the regret of Approx-Zooming-With-No-Arm-Similarity is stated in [7 ] as 3 O (KT

Neural Information Processing SystemsAug-19-2025, 23:13:53 GMT

We greatly appreciate the feedback of the reviewers. We discuss the specific concerns of the reviewers below. We will include this discussion into the paper. We will include empirical results of a gaussian process-based bandit in the final paper. We will look into the techniques of Qian and Y ang (2016) for adaptivity to the smoothness.

context space, dimension, reviewer 1, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.36)

Add feedback

Filters

Collaborating Authors

context space

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Nonparametric Contextual Bandits in Metric Spaces with Unknown Metric

aceacd5df18526f1d96ee1b9714e95eb-AuthorFeedback.pdf

Contextual Combinatorial Multi-armed Bandits with Volatile Arms and Submodular Reward

4556f5398bd2c61bd7500e306b4e560a-Supplemental-Conference.pdf

Tracking Most Significant Shifts in Nonparametric Contextual Bandits

Contextual Combinatorial Multi-armed Bandits with Volatile Arms and Submodular Reward

68a9750337a418a86fe06c1991a1d64c-Paper.pdf

Online Decision Making with Generative Action Sets

Nonparametric Contextual Bandits in Metric Spaces with Unknown Metric

Reviewer 1: 2 A theoretical upper-bound of the regret of Approx-Zooming-With-No-Arm-Similarity is stated in [7 ] as 3 O (KT